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FORECASTING TRENDS WITH ASSET PRICES 


Ahmed Bel Hadj Ayed^’^, Gregoire Loeper Frederic Abergel ^ 

Abstract. In this paper, we consider a stochastic asset price 
model where the trend is an unobservable Ornstein Uhlenbeck pro¬ 
cess. We first review some classical results from Kalman filtering. 
Expectedly, the choice of the parameters is crucial to put it into 
practice. For this purpose, we obtain the likelihood in closed form, 
and provide two on-line computations of this function. Then, we 
investigate the asymptotic behaviour of statistical estimators. Fi¬ 
nally, we quantify the effect of a bad calibration with the contin¬ 
uous time mis-specified Kalman hlter. Numerical examples illus¬ 
trate the difficulty of trend forecasting in financial time series. 


Motivations 

Asset prices are well described by random walks. The underlying 
economic foundation, introduced in 1900 by Louis Bachelier (see 0), 
is that, in an efficient market, price changes reflect new information 
and hence approximate a random variation. The hrst analytical and 
realistic approach was proposed in 1959 by Osborne (see [2H]), who 
modelled returns as an ordinary random walk. In this case, future 
returns are not predictable. Nevertheless, people have divergent views 
on the subject and trend following strategies are the principal sources 
of returns for Commodity Trading Advisors (see ISl). Furthermore, 
most quantitative strategies are based on the assumption of trends 
extracted from the asset prices (see Bi.ra). This component contains 
information about global changes and is useful for prediction. This 
estimation is a very difficult problem because of a high measurement 
noise. Consider for example the simple model with a constant trend 
^ = fidt -h asdWf: the best trend estimate at time t is pt = y 
The Student’s t-test will reject the hypothesis /i = 0 at time T if 
|/ir| > with Qa > I {qa = 1-96 for a signihcance level equal 

to a = 5%). For instance, with as = 30%, the estimate Jit = 1% 
is statistically relevant if T > qI 900 years. Generally, unobserved 
stochastic trends are considered and hltering methods are used (see 
m, [30],[20] or [H). Most of these filters introduce a parametric trend 
model. Therefore, these methods are confronted to the choice of the 
parameters. Based on the Bayesian approach (the Kalman hlter, the 
Extended Kalman hlter or the Particle hlter for example) or on the 
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maximum likelihood estimation (see | 22 j . 12a, 0, 0 or m), several 
studies on the inference of hidden processes have been done in the past. 
Many researchers applied these methods on hnancial time series, but 
they often focus on stochastic volatility processes (see [H] , [13] , [IE] or 


uni). 


The purpose of this work is to assess the feasibility of forecasting 
trends modeled by an unobserved mean-reverting diffusion. 

The paper is organized as follows: the hrst section presents the model 
and recalls some results from Kalman hltering. 

The second section is devoted to the inference of the parameters 
with discrete time observations. Since all statistical estimators are 
based on the likelihood, two recursive computations of this function 
are presented. Based on the Valentine Genon-Catalot’s results (see [TE] 
for details), the performance of statistical estimators is evaluated by 
giving their asymptotic behaviours, and by providing, in closed form, 
the Cramer Rao bound. 

In the third section, we introduce the continuous time mis-specihed 
Kalman hlter (like in [25], where the authors consider the case of an 
asset whose trend changes at unknown random times), which takes 
into account a bad calibration of the parameters. First, the impact of 
parameters mis-specihcation on trend hltering is quantihed with the 
law of the residuals between the hlter (mis-specihed or not) and the 
hidden process. Then, we derive the probability to have a positive 
trend, knowing a positive estimate. Due to the non-zero correlation 
between the trend and the hlter, this probability is always superior to 


0.5. 


Finally, numerical examples illustrate the difficulty of this calibration 
and show that the impact of parameters mis-specihcation on trend 
hltering is not negligible. 


1. Framework 


This section begins by introducing the model, which corresponds to 
an unobserved mean-reverting dihusion. After discretizing the model, 
we recall the Kalman hltering method. 

1.1. Model. 

1.1.1. Continuous time model. Consider a hnancial market living on a 

stochastic basis (D, F,P), where F = {J^t} is a hltration on D, and 

P is the objective probability measure. Suppose that the dynamics of 
the risky asset S is given by: 



St 


( 1 ) 

( 2 ) 


dfit = -XiM/ntdt + a^dWj^, 
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with and two uncorrelated Wiener processes and po = 0. We 
also assume that (A^, cr^, ( 75 ) G x x and that the trend /i 
and the Brownian motion are F-adapted. 

Remark 1.1. Let he the augmented filtration generated by 

the price process S. Only -adapted processes are observable, which 
implies that agents in this market do not observe the trend fi. 


1.1.2. Discrete time model. Let 5 be a discrete time step. To simplify 
the notation, k is used for tk = kS. The discrete time model is: 


Vk+i 


Sk+i — Sk 

SJ 


h'k+l + Uk+l, 

e + Vk, 


(3) 

(4) 


where Uk ^ M ^0, ^ j and Ufc ~ W ^0, ^ (l — e 'ppg system 


(3)-(4) corresponds to an AR(1) plus noise model. 


1.2. Optimal trend estimator. 


1.2.1. Discrete Kalman filter. 

(A^ 


(T„) and as are supposed to be known 


In this sub section, the parameters 9 = 
The discrete time system 
(3)-(4) corresponds to a Linear Gaussian Space State model where the 
observation is y and the state fv (see [5] for details). In this case, the 
Kalman Liter gives the optimal estimator, which corresponds to the 
conditional expectation K[frk\yi, ■■■,yk]- In the following, to simplify 
the notation, X^/i represents K[Xk\yi, ...,yi]. Appendix A presents a 
detailed introduction to the discrete Kalman Liter. This Liter contains 
two distinct phases: 


( 1 ) 

( 2 ) 


An a priori estimate given fik+i/k and T^+i/fc = E [(/ifc+i — 
fik+i/k){k‘k+i — • This estimate is done using the tran¬ 


sition equation (4) 


An a posteriori estimate. When the new observation is avail¬ 


able, a correction of the Lrst estimate is done to obtain /ifc_|_i/fc_|_i 

and Vk+i/k+i = E [(pfc+i — fik+i/k+i){k-k+i — fik+i/k+if'\- The 
criterion for this correction is the least squares method. 


Thus, fik/k is fhe minimum variance linear unbiased estimate of the 
trend To this end, it is the optimal recursive state estimate for a 
Linear Gaussian model. Formally, the iterative method is given by: 


Ffc+l/fc+l 6 ^ h'k/k T ^k+1 {Vk+l 6 ^ hfc/fc) ) (5) 

hfe+i/fc+i = (1 — A'fc+i) Tfc+i/fc, (6) 
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with 


Kk+i 

hfc+i/fc 


k-\-l/k 


hfe+i/fc + -f- 




2A, 


(1 


,-2Au<5 


)■ 


1.2.2. Stationary limit and continuous time representation. Looking 
for Tk+i/k+i = Tfc/fc (which corresponds to the steady-state regime), 
we hnd: 

“ “ 2e-2^M5 ’ 

where / (a^, A^, (l - , 

I 2CT^CT^ 

and ^ (cTs, A^, cr^) = J f (erg, A^, cr^)^ + ^ 

Using the stationary covariance error Loo, a stationary gain iL^o is 
dehned and the estimate can be rewritten as a corrected exponential 
average: 

OO 

hn+i = K^Y1 

i=0 

The steady-state Kalman hlter has also a continuous time limit de¬ 
pending on the asset returns: 


Proposition 1. 


dS 

d% = -X^f3 (A^, cr^, 0 - 5 ) Jitdt + A^ (/3 (A^, a^, as) - 1 ) 


where 


(3 {\^,a^,as) = 1 + 


a:. 




Proof. Based on [19], the Kalman hlter is given by: 

P (k) dS^ 


E[pt\J^f] = (p{t)(fio + \ 
V ^5 

(j){t) = e "S 


Iq (j) (m) Su 


( 8 ) 

(9) 


where the estimation error variance P is the solution of the following 
Riccati equation: 

P'(t) = rlpkf.2\P(t)+al. 
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In this steady-state regime, we have P' (t) = 0. Then, the positive 
solution of this equation is given by: 

= alXf, {13 (A^, as) - 1), 

and the steady-state Kalman filter follows: 


(t) 


— (^) ( ho H-2 


dS,, 


(^S Jo 0°° (w) Su 




Since: 

d(p°° (t) 

0 OO ~ [Xfi, o'/i, 0's) dt, 

the steady-state Kalman hlter satishes the stochastic differential equa¬ 
tion (j^. □ 

This continuous representation can be used for risk return analysis 
of trend following strategies (see for details). The Kalman hlter 
is the optimal estimator for linear systems with Gaussian uncertainty. 
In practice, the parameters 6 = (A^, are unknown and must be 
estimated. 


2. Inference oe the trend parameters 


In this section, the problem of the parameters inference is treated. 
Based on discrete time observations, two classes of methods can be 
considered. The hrst one is based on backtesting. Each set of parame¬ 
ters dehnes one trend estimator and can be applied to several trading 
strategies. Backtests can be used in order to choose the parameters but 
even if several in- and out-of-sample periods are used, it will not ensure 
a good ht of the model. An alternative and more rigorous way exists: 
the use of statistical estimators. For example. Maximum Likelihood 
and Bayesian estimators have good properties (like consistency). To 
this end, this second approach is considered. For the model (3)-(4) 
Peter Lakner 


(see [H]) and Ofer Zeitouni (see m) develop methods 
based on the algorithm EM in order to attain the maximum likeli¬ 
hood estimator but Fabien Campillo and Franois Le Gland suggest 
that direct maximization should be preferred in some cases (see [S] for 
details). Since the Bayesian estimators are also based on the likelihood, 
we present two on-line computations of this function. Using the Valen¬ 
tine Genon-Gatalot’s results (see na for details), we close this section 
by analysing the asymptotic behaviours of statistical estimators and 
by providing the Gramer Rao bound in closed form. 


2.1. Likelihood Computation. The likelihood can be computed us¬ 
ing two methods. The hrst one is based on a direct calculus while the 
second method uses the Kalman hlter. 
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2 .1.1. Direct computation of the likelihood. A first approach is to di¬ 
rectly compute the likelihood. The vectorial representation of the dis¬ 
crete time model (3)-(4) is: 


Vi 

Vn 

\T 


hi 


Ml 


+ 


/ \ Un 

\T 


where (pi, • • • , /iat) and (wi, • • • , Un) , knowing 6 = (cr^, A^), are two 
independent Gaussian processes. Therefore the vector (i/i, • • ■ .Hn) ^ 
knowing 9, is also a Gaussian process. The likelihood is then charac¬ 
terized by the mean and the covariance 

( 10 ) 
( 11 ) 


^yi-N\d = 0 (ho = 0 is supposed) , 


ui:N\e = and = (Con (p*, Since the 


where S 

drift /i is an Ornstein Uhlenbeck process, then: 

,2 


Com (/it,/i,) = 
Finally, the likelihood is given by: 


_ ^ /^2X^sAt 


!)■ 


f {yi,...yN\0) = 






( 12 ) 


(13) 


Remark 2.1. When the dimension N is large, it is extremely diffi¬ 
cult to directly invert the covariance matrix and to compute the 

determinant of this matrix. An iterative approach can he used instead 
and the details of which are given in Appendix [5[ 

2.1.2. Computation of the likelihood using the Kalman filter. The like¬ 
lihood can also be evaluated via the prediction error decomposition 
(see [31] for details): 

/ (i/l, ...i/Tvl^) = / (i/Tvbl, ■••2/7V-1, 6 ')/(//!,...//Ar_i 10 ) . 

N 

= Ylf {yn\yi,-yn-u0), 

n=l 

and the conditional laws are given by the following proposition: 
Proposition 2. The process {yn\yi, ...yn-iyd) is a Gaussian: 

{yn\yi,-yn-i,0) ~ V , 


^yn\n-l ^ hn-l/n-l5 


= e 


-2X5 


Tn-l/n-l + ^ 
6 


..-2X5 


)K-f. 


and 






The a posteriori estimate of the trend jln-i/n-i the covariance 
error Vn-i/n-i are given by Kalman filtering (see Eguations ^ and 


Proof. Since the process Pn is a Gaussian, the process {ynlvi, ■■■Vn-i, d) 
is also Gaussian. Moreover, using Equations (3) — (4), we have: 


and 


M 


yn\n — l 


l^nln—1 


^ “1“ 0; 


^n/n-1 + ^ 5 

2 

p _ —2A5p I ^r- (1 —2A(5\ 

t njn—l 6 t Yi—l/n—1 “T G C J . 


□ 


Remark 2.2. In practice, the volatility is not constant. However, if 
the volatility as is -adapted, the two methods can he adapted and im¬ 
plemented. This assumption is satisfied if the volatility is a continuous 
time process. 

2.2. Performance of statistical estimators. In this sub-section, 
the asymptotic behaviour of the classical estimators is investigated. 


2.2.1. Asymptotic behaviour of statistical estimator. The discrete time 


model (3)-(4) can be reformulated using the following proposition (see 
for details): 


Proposition 3. Consider the model (3)-(4) with {X^,a^,as) G x 
X In this case, the process (j/i) is ARMA{1, 1 ). 


The asymptotic behaviour of the classical estimators follows. In¬ 
deed, the identihability property and the asymptotic normality of the 
maximum likelihood estimator are well known in stationary ARMA 
Gaussian processes (see [U], section 10.8). Moreover, the asymptotic be¬ 
haviour of the Bayesian estimators are also guaranteed by the ARMA(1,1) 
property of the process (j/j). If the prior density function is continu¬ 
ous and positive in an open neighbourhood of the real parameters, the 
Bayesian estimators are asymptotically normal (see [32] in which a gen¬ 
eralized Bernstein Von Mises theorem for stationary ’’short memory” 
processes is given, or 123 for a discussion on the Bayesian analysis of 
ARMA processes). 


2.2.2. Cramer Rao bound. This bound is the lowest variance of the 
unbiased estimators. The following corollary of the Gramer Rao bound 
Theorem gives a formal description of the GRB. 
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Corollary 2.3. Consider the model (3)-(4) and N observations (yi, • • • 

, Vn)^■ Suppose that (A^, cr^, as) G x x If 9j^ is an unbiased 
estimator of 9 = (A^,(T^), we have: 

Cove (Gn) > CRB (9). 

This bound is given by CRB (9) = (9), where (9) is the Fisher 

Information matrix: 


(In (e)),j = -E 




d9id9j 


and In {9) = NR (9). Moreover, the maximum likelihood estimator 


9ff^ attains this bound: 


Vn( 9^^-9') ^M{0,If\9)). 

This result is a consequence of Proposition]^ (see [6], section 10.8). 
We can also provide an analytic representation of the Fisher informa¬ 
tion matrix: 


Theorem 2.4. For the model (3)-(4), if (A^, cr^, as) € IP+ x x 
we have: 


= (iS /_„ N (^) ^ (--) (--) d^) 

where fe is the spectral density of the process {yi): 




Un = 


2Xa 


(1 _ e-2A.-5^ + # (1 + - 




COS (cj) 


1 -f cos (u) 


Proof. The Whittle’s formula (see [33] for details) gives the integral 
representation of the Fisher information matrix. Since the process (?/*) 
is ARMA(1,1), the expression of its spectral density follows (see [6|, 
section 4.4). □ 

Finally, the Cramer Rao Bound of the trend parameters can be com¬ 


puted using Theorem 2.4 


3. Impact of parameters mis-specification 

In this section, we consider the continuous time Kalman hlter with 
a bad calibration in the steady-state regime. First, we characterize 
the law of the residuals between the hlter (mis-specihed or not) and 
the hidden process. Finally, we study the impact of parameters mis- 
specihcation on the detection of a positive trend. 
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3.1. Context. Suppose that the risky asset S is given by the model 


(l)-(2) with 9* = (cr*, A*), and suppose that an agent thinks that the 


parameters are equal to6 = (cr^, A^). Assuming the steady-state regime 
and using these estimates and Proposition the agent implements the 
continuous time mis-specihed Kalman hlter: 


d-fit = + X^{(3- 1) 


(14) 


where jd = jd (A^,cr^,(j 5 ) (see Equation (|^) and /tq = 0. The following 
lemma gives the law of the mis-specihed Kalman hlter: 


Lemma 3.1. Consider the model (l)-(2) with 9* = In this 

case, the mis-specified continuous time filter of Equation (I 4 ) is given 
by: 

= A^ (/? - 1) e^>^^^iilds + ^s ^ . (15) 

Moreover, fi is a centered Gaussian process and its variance is given 
by: 


Yar [fit] = E [fi[] = 


xiifi-iY{a;y 

Xf (A^/3 - A;) 

2g-(V/3+A;)t _ g-2A;t _ g-2A^/3t g 
+-^-+ - 


Xufd + X* 


tih ' ' 'fi 

-2A^/3t _ ^ 


- K 

Xf, {(d-lfal 


2X,(d 


2(d 


(1 


e-2A,. 


/3t) _ 


Proof. Applying Ito’s lemma to the function / (p, t) = e^'^^^fit, and 
integrating from 0 to t, Equation (15) follows. Therefore, fi is also 
a Gaussian process. Its mean is zero (because /Xg = 0). Since the 
processes p,* and are supposed to be independent, the variance of 
fi is given by the sum of the variances of the terms in Equation (15). 
Moreover: 


Yar 

Yar 


eYhdWy 


IJO 


g2A^/3t _ 


2X.p 




L./0 


'0 JO 



dsids2, 


and Con given by Equation (12). The variance of the pro¬ 
cess fit follows. □ 


3.2. Filtering with parameters mis-specification. The impact of 
parameters mis-specihcation on trend hltering can be measured using 
the diherence between the hlter and the hidden process. The following 
theorem gives the law of the residuals. 
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Theorem 3.2. Consider the model 


(l)-(2) with 9* = (a*, A*) and the 


trend estimate defined in Equation In this case, the process fi—ja 
is a centered Gaussian process and its variance has a stationary limit: 

JkaVar [ft - = g (a„ (/? - 1)" + X; ({fi-f - l) .(16) 


where fi = fi (A^, as) and fi* = fi (A*, a*, as) (see Equation 
Moreover, if = (cr*,A*), Equation (16) becomes: 

lim Yar [fi; - /i*] = X*al (fi* - 1). 

t^oo ^ 


(17) 


Proof. Using Equation (15), it follows that the process (1 — p* is a 
centered Gaussian process. The variance of this difference can be com¬ 
puted in closed form: 


Yar [pt - p*t\ = Yar \fit] + Yar [pf\ - 2 * Cov [pt,p*t \, 
where Yar [pf\ = (l — e~‘^^*Y), and Yar \flf\ is given by Lemma 


3.1[ Since the processes and p* are supposed to be independent, 

A^(/)-i)P;)- 


we have: 


C0Y[pt,pl\ = 




2A* y X^fi + X* 

3~2A*t _ 


- K 

The asymptotic variance is obtained by tending t to inhnity: 


lim Yar [pt — /i^ ] = 

t^OO 


K ifi - 1 ) 

2/3 


(/3 - 1 ) 4 


w +1) 


2Ar 


and Equation (16) follows. Finally, Equation is obtained by tend¬ 
ing 6* to 6**. □ 


Remark 3.3. Consider the well-specified case (a^, A^) = (a*, A*). Us¬ 
ing Equation 0. it follows that: 


hm 

t-s-oo Yar [p^] 



(18) 


Then, the asymptotic relative variance of the well-specified residuals is 
an increasing function of X*^ and a decreasing function of af. 
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3.3. Detection of a positive trend. In practice, the trend estimate 
(mis-specified or not) can be used for an investment decision. For 
example, a positive estimate leads to a long position. So, it is inter¬ 
esting to know the probability to have a positive trend, knowing a 
positive estimate. We derive this probability in closed form. The fol¬ 
lowing proposition gives the asymptotic conditional law of the trend 
(htiht = x)- 


with 6* = (cr*, A*) and the 
trend estimate defined in Equation In this case: 


Proposition 4. Consider the model (l)-(2) 


= x) 4 M {I 

t —>00 


.,Var~ 






). 


(19) 


with: 



Yar 


00 

u*\h 


Ay ((p-f - 1) 

iP-l) + 


Var“ 


1 - 


(a; + Ay) (Ay + x; (p-f) 


( 20 ) 

( 21 ) 


where YarP — 

U 2\* 


Moreover, i/(a^, A^) = (a*. A*), Equation (19) becomes: 


(p*|/i*=a;) 4 U 

t^OO 


X. 


2Y arY 


( 22 ) 


’ P' + 1 

where fi* = fi (A*,cr*,cr 5 ) (see Equation ^). 

Proof. Since the estimate fi and the trend fi* are two centred and cor¬ 
related Gaussian processes (see Lemma 3.1 and the proof of Theorem 


3.2[ ), the conditional law {Et\Pt = x) is Gaussian with a mean and a 

Con 


variance given by: 


VtlAt 


Yar [fit 


-X, 


= Yar [/r*] 


Con {fit,ti*t] 
Yar [fit] 


Using Lemma 3.1 and the expression of C.ov{fit,El) the proof of 
Theorem 13.21 


lim ! 

t^OO 


JhnVar^*!^, = Var“|^, 


and Equation (19) follows. Moreover, Equation (22) is obtained by 
tending 6* to 6**. □ 
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The following proposition is a consequence of the previous propo¬ 
sition. It gives the asymptotic probability to have a positive trend, 
knowing a positive estimate equal to x. 


Proposition 5. Consider the model\(l)-(2)\ with 6* = (a*, A*) and the 


trend estimate defined in Equation In this case: 


lim P {fi* > 0\fit = x) = Poo ifi* > 0|/i = x) , (23) 

t^OO 


where 


Poo (/i* > 0 \fi = x) 


1 - $ 





(24) 


where and are defined in Equations (20) and (21), 

and <h is the cumulative distribution function of the standard normal 
law. 

Moreover, if x > 0 and (a^, A^) = (a*, A*), this asymptotic probabil¬ 
ity becomes an increasing function of a* and a decreasing function of 

A?,. 


Proof. Equations (23) and (24) follow from Proposition Now, con¬ 
sider the well-specified case (a^, A^) = (a*. A*) and x > 0. Using 


Equation (22), it follows that: 


= / (a;, a;, as) , 


where 


/ (<. K’ '^s) = 


cr 


Since 


dX* 


da* 






cy 


( a ;)" [i + ji + ffifi + cy 








a: 


0 


— > 0 , 


< 0 , 


the asymptotic well-specihed probability to have a positive trend, know¬ 
ing a positive estimate equal to x is an increasing function of a* and a 
decreasing function of A*. □ 
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Remark 3.4. This probability is an increasing function of x. Indeed, 
it is easier to detect the sign of the real trend with a high estimate than 
with a low estimate. Moreover, this probability is always superior to 
0.5. This is due to the non-zero correlation between the trend and the 
filter. ^45 shown in the previous sections, trend filtering is easier with 
a small spot volatility. Here, the probability to make a good detection 
is also a decreasing function of the as. 


4. Simulations 

In this section, nnmerical examples are compnted in order to make 
the reader aware of the trend hltering problem. First, the feasibility 
of trend forecasting with statistical estimator is illustrated on different 
trend regimes. Then, the effects of a bad forecast on trend hltering and 
on the detection of a positive trend are also discussed. 


4.1. Feasibility of trend forecasting. Suppose that only discrete 
time observations are available and that the discrete time step is equal 
to 5 = 1/252. In this case, the agent uses the daily returns of the risky 
asset to calibrate the trend. We also assume that the agent uses an 
unbiased estimator. Given T years of observations. The Cramer Rao 
Bound is given by: 


CRBt {9) 


id) 

T*252’ 


where R (9) is given by Theorem 2.4 The smallest conhdence region is 


obtained with this matrix. In practice, the real values of the parameters 
9 are unknown and asymptotic conhdence regions are computed (re¬ 
placing 9 by the estimates 9 in the Fisher information matrix R ( 9 


Since the goal of this subsection is to evaluated the feasibility of this 
problem, we suppose that we know the real values of the parameters. 
Then the real Cramer Rao Bound can be computed. Suppose that 
a target standard deviation Xi is hxed for the parameter 9i. In this 
case, to reach the precision Xi, the length of the observations must be 
superior to: 


in' w)u 

252 ' 


We consider a hxed spot volatility as = 30%, two target precisions 
for each parameter 9i and we compute Tf for several conhgurations. 
The hgures BiB and 1^ represent the results. It is well known that 
for a high measurement noise, which means a high spot volatility, the 
problem is harder because of a low signal-to-noise ratio. The higher 
the volatility, the longer the observations must be. Here, we observe 
that with a higher drift volatility a^ and a lower A^, the problem is 
easier. Indeed, the drift takes higher values and is more detectable. 
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Moreover, the simulations show that the classical estimators are not 
adapted to a so weak signal-to-noise ratio. Even after a long period of 
observations, the estimators exhibit high variances. Indeed the smallest 
period of observations is superior to 29 years. It corresponds to a target 
standard deviation equal to 0.5 for a real parameter = 1 and a trend 
standard deviation equal to ~ 63%. Therefore, for this 

conhguration, after 30 years of observations, the standard deviation 
is equal to 50% of the real parameter value A^. After 742 years, this 
standard deviation is equal to 10%. Even with this kind of regime, the 
trend forecast with a good precision is impossible. 


Time to reach stdCa^) = 0.05 



Figure 1. Time to reach a target standard deviation 
on (j^ equal to 0.05 (In(years)) 
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Time to reach stdCa^) = 0.01 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 


Figure 2. Time to reach a target standard deviation 
on equal to 0.01 (In(years)) 


Time to reach std{^) = 0.5 



"oi 02 03 04 05 06 07 08 Os” 


Figure 3. Time to reach a target standard deviation 
on equal to 0.5 (In(years)) 


15 




Time to reach std{A.) = 0.1 



"^1 02 03 04 05 06 07 08 09 


Figure 4. Time to reach a target standard deviation 
on eqnal to 0.1 (In(years)) 


4.2. Impact of parameters mis-specification on trend filtering. 

This snbsection illustrates the impact of parameters mis-specihcation 
on trend hltering. Using the results of Theorem 3.2[ we represent, for 
different conhgurations, and for the well- and mis- specihed case, the 
asymptotic standard deviation of the residuals between the trend and 
the hlter. The hgures and represent the asymptotic standard de¬ 
viation of the trend and of the residuals in the well-specihed case (the 
agent uses the real values of the parameters) for different conhgura¬ 
tions. As seen in Equation ( pT| , the asymptotic standard deviation of 
the well-specihed residuals is an increasing function of the drift volatil¬ 
ity cr* and a decreasing function of the parameter A*. For A* = 1 and 
(J* = 90%, the standard deviation of the residuals (~ 44%) is inferior 


For a high A* and a 


to the standard deviation of the trend (~ 64%) 
small drift volatility, the two quantities are approximately equal. This 
hgure leads to the same conclusions than Equation (18). Indeed, like 
the calibration problem, the problem of trend hltering is easier with a 
small A* and a high drift volatility cr*. 

Now consider the worst conhguration as = 30%, A* = 5 and cr* = 
10%. The hgure represents the asymptotic standard deviation of the 




This regime corresponds to a 

,- 1/2 


residuals for diherent estimates (A 

standard deviation of the trend equal to a* (2A*) ^ 3.2% and to a 

standard deviation of the residuals equal to 3.16% in the well-specihed 
case. If the agent implements the Kalman hlter with A^ = 1 and 
cr^ = 90%, the standard deviation of the residuals becomes superior to 
25%. Finally, consider the best conhguration as = 30%, A* = 1 and 


c, = 


90%. The hgure [^represents the asymptotic standard deviation of 
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the residuals for different estimates This regime corresponds 

to a trend standard deviation equal to a* (2A*) ~ 63% and to a 

standard deviation of the residuals equal to 44% in the well-specihed 
case. If the agent implements the Kalman hlter with = 5 and 
= 10%, the standard deviation of the residuals becomes superior 
to 60%. Even with a good regime, the impact of parameters mis- 
specihcation on trend hltering is not negligible. 
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Figure 5. Asymptotic standard deviation of the trend 
as a function of the trend parameters with as = 30%. 
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Figure 6. Asymptotic standard deviation of the resid¬ 
uals of the well-specihed Kalman hlter as a function of 
the trend parameters with as = 30%. 
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Figure 7. Asymptotic standard deviation of the resid- 
nals of the mis-specihed Kalman hlter as a fnnction of 
the trend estimate parameters with as = 30%, A* = 5 
and a*. = 10%. 
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Figure 8 . Asymptotic standard deviation of the resid- 
nals of the mis-specihed Kalman hlter as a fnnction of 
the trend estimate parameters with as = 30%, A* = 1 
and a* = 90%. 


4.3. Detection of a positive trend. In this snbsection, nsing Eqna- 
tion (24), the asymptotic probability to have a positive trend, knowing 
a trend estimate eqnal to a threshold x is illustrated. In order to com¬ 
pare this probability for diherent trend regimes, we choose a threshold 
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equal to the standard deviation of the hlter /i. First, this quantity is 
tractable in practice. Moreover, since the continuous time mis-specihed 
hlter p is a centered Gaussian process, the probability that jl becomes 
superior (or inferior) to its standard deviation is independent of the 
parameters (a*. A*, a^, A^, US') . First, suppose that the agent uses the 
real values of the parameters and consider the asymptotic probability 
P (p* > 0|/i* = to have a positive trend, knowing an estimate 

equal to its standard deviation. The hgure represents this probabil¬ 
ity for different regimes. As seen in Proposition in the well-specihed 
case, this probability is an increasing function of the trend volatility 
cr* and a decreasing function of A*. Like the calibration and the hlter- 
ing problem, the detection is easier with a small A* and a high drift 
volatility. Now, suppose that the agent uses wrong estimates (cr^, A^). 
In this case, the agent implements the continuous time mis-specihed 
Kalman hlter. The hgures and 11 represent the asymptotic prob¬ 


ability P [fi* > 0\ft = \/^~^) for the best and the worst conhguration 
of the hgure 1^ As explained in Remark 3^, this probability is always 
superior to 0.5, even with a bad calibration of the parameters. For 
each case, the probability to have a positive trend, knowing an esti¬ 
mate equal to its standard deviation does not vary a lot with an error 
on the parameters. This quantity seems to be robust to parameters 
mis-specihcations. 
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Figure 9. Asymptotic probability to have a positive 
trend given a well-specihed estimate equal to its standard 
deviation with as = 30%. 
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Figure 10. Asymptotic probability to have a positive 
trend given a mis-specified estimate equal to its standard 
deviation with as = 30%, A* = 1 and a* = 90%. 
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Figure 11. Asymptotic probability to have a positive 
trend given a mis-specihed estimate equal to its standard 
deviation with as = 30%, A* = 5 and a* = 10%. 
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5. Conclusion 


The present work tries to illustrate the difficulty of trend filtering 
with a model based on an unobserved mean-reverting diffusion. This 
model belongs to the class of Linear Gaussian Space State models. 
The advantage of this kind of system is to have an on-line method of 
estimation: the Kalman filter. 

In practice, the parameters of the model are unknown, then the cali¬ 
bration of filtering parameters is crucial. The linear and Gaussian case 
allows to compute, in closed form, the likelihood. The Kalman filter 
can also be used for this calculus. These methods can be generalized 
to a non-constant volatility and classical estimators can be easily put 
in practice. 

Although this framework is particularly convenient for forecasting, 
the results of the analysis show that the classical estimators are not 
adapted to a so weak signal-to-noise ratio. Moreover, the linear and 
Gaussian model has deficiencies. In practice, hnancial asset returns are 
heavy-tailed (see 1231 ) because of jumps, volatility fluctuations. So, the 
stationarity is not guaranteed on a long horizon of observations. Then, 
the problem of trend hltering with real data is harder than with the 
Linear and Gaussian framework. 

With this simple model, the horizons of observations needed for a 
acceptable precision are too long. Therefore, the convergence is not 
guaranteed and the impact of mis-specification on trend hltering is not 
negligible. We surely conclude that it is impossible to estimate the 
trend with a good precision. Despite these difficulties, the non-zero 
correlation between the trend and the estimate (mis-specihed or not) 
can be used for the detection of a positive (or negative) trend. 
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Appendix A: discrete Kalman filter 

Framework. This section is based on im. The discrete Kalman hlter 
is a recursive method. Consider two objects: the observations {yk} and 
the states of the system {xk}- This hlter is based on a Gauss-Markov 
hrst order model. Consider the following system: 

^k+l FkXk T Tfc, 

Vk = HkXk + Uk- 

The hrst equation is an a priori model, the transition equation of the 
system. The matrix Fk is the transition matrix and Vk is the transition 
noise. The second equation is the measurement equation. The matrix 
Hk is named the measurement matrix and Uk is the measurement noise. 
The aim is to identify the underlying process {xk}- The two noises are 
supposed white, Gaussian, centered and decorrelated. In particular: 



The two noises are also supposed independent of Xk and the initial 
state is Gaussian. So, it can be proved with a recurrence that all states 
are Gaussian. Therefore, just the mean and the covariance matrix are 
needed for the characterization of the state. The estimation is given 
by two steps. The hrst one is an a priori estimation given Xk+i/k 
and Tk+i/k = E [(x^+i - Xk+i/k){xk+i - Xk+i/kV]- When the new ob¬ 
servation is available, a correction of the estimation is done to ob¬ 
tain Xk-\-l/k+l Sud '^k+l/k+1 E Xk-\-\/k+l){,^k+l ^k+l/k+l) ]• 

This is the a posteriori estimation. The criterion considered for the 
a posteriori estimation is the least squares method, which corresponds 
to the minimization of the trace of Tk+i/k+i- 

Filter. The prediction (a priori estimation) is given by 

i^k+l/k F'k^k/ki 

Efc+i/fc = FkTk/kFk+Rk- 

The a posteriori estimation is a correction of the a priori estimation. 
A gain is introduced to do this correction: 

Sjk+l/k+l S^k+l/k T Kk+1 ijJk+1 FIk-\-\Xk-\-l/k^ • 

As explained above, the gain Kk+i is found by least squares method, 
which corresponds to 

dtrace{Tk+i/k+i) _ 

dKk+i “ 

With the classical lemma of derivation for matrix, the gain is found: 

A'fc+i = Tk+i/kHl^i[Hk+iF'k+i/kFlk+i +FFI^^ , 

Efc+l/fc+l Kk+lFUk+l) Fk+l/k- 
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Appendix B: Iterative methods for the inverse and the 

DETERMINANT OF THE COVARIANCE MATRIX 

In this appendix, we provide iterative methods for the inverse and 
the determinant of the covariance matrix. 


Inverse of the covariance matrix. The nse of the Matrix Inversion 
Lemma on Eqnation (10) gives: 


v-i _ v-i _ v-i /i-iv-i 

yi:N\d Ail:Ar|0 IJ-1:n\6 V /il;jv|0’ 


where = -^In + Then, we have to compnte the inverse of 

the matrices and 


Inverse of the matrix A^. Snppose that is compnted. The matrix 
Ajv+i can be broken into fonr snb-matrices: 


where 


An+i — 


Bi \ 
Bs B, 


6 2A^ 

^ (eM'5 — ’ 

° ■■■ 

Bs = 

Bi = Atv- 


Therefore, the matrix A^r+i can be inverted blockwise. 


Inverse of the matrix The following lemma is used (see pQ for 

details): 


Lemma 5.1. Let fi he an Ornstein Uhlenbeck process with parameters 
9 = (A^,(T^). The covariance matrix o/pi ,/iat is Then: 





-1 

0 



0 

-1 

—A^(5 

-1 




• • o 

-1 

A j / (5 1 — \ / / S 

e +e 

-1 





-1 

gA^(5_|_g —A^(5 

-1 

0 




-1 


-1 

0 



0 

-1 

e'^M' 


\ 


/ 


23 








Therefore, the inverse of the matrix S 


Ml:Ar + lF 


^2(eV^_e-V«) 
-2A^ 


Ml:iV + l 

-2A^ 


f72(eV«-e-V^ 


is given by: 

0 ■■■ 0 \ 




V 


0 


/ 


Procedure. Finally, at time t, the inverse of the covariance matrix is 
given by the following protocol: 

• Computation of the matrix using A^\. 

• Computation of the matrix using ^\g- 

• Using S-U, = I, - S-i 
obtained. 


iflA |„, the matrix S is 
Aii;t|6 ' yi-.tw 


Determinant of the covariance matrix. The iterative computation 
of det is based on the following lemma: 

Lemma 5.2. The determinant of the matrix is given by: 


det 

and for N > 2, we have: 


det (In + 


det (S ^ 


(25) 


det(s ^ |„ 


^ - g{X^,a^){e^i-^+e V«) det(s^l^|g) 


-g(A^,o-^)^ det( S ^ 






where 


9 if^gi 


2A„ 


(j2 _ e-A^5) • 


Proof. The multiplication of Equation (10) by gives: 


a 


S V'”! 




^ yi:N\S' 


Equation (25) follows. Using Lemma 5.1, The matrices 


In + ^^d are tridiagonal. The recursive computa¬ 
tion of their determinant is then possible. □ 
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